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Deterministic vs Non-deterministic Graph Property Testing 

Lior Gishboliner * Asaf Shapira^ 



Abstract 



A graph property V is said to be testable if one can check if a graph is close or far from 
satisfying V using few random local inspections. Property V is said to be non-deterministically 



testable if one can supply a "certificate" to the fact that a graph satisfies V so that once the 
certificate is given its correctness can be tested. The notion of non-deterministic testing of graph 
properties was recently introduced by Lovasz and Vesztergombi [5 , who proved that (somewhat 
surprisingly) a graph property is testable if and only if it is non-deterministically testable. Their 
proof used graph limits, and so it did not supply any explicit bounds. They thus asked if one 
can obtain a proof of their result which will supply such bounds. We answer their question 
positively by proving their result using Szemeredi's regularity lemma. 

An interesting aspect of our proof is that it highlights the fact that the regularity lemma 
can be interpreted as saying that all graphs can be approximated by finitely many "template" 
graphs. 
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^ ■ 1 Introduction 

We consider properties of finite graph, where a property of graphs is simply a family of graphs closed 
under isomorphism. The main focus of our paper is the following notion of efficiently checking if 
a graph satisfies property V or is e-far from satisfying it, where a graph G is said to be e-far from 
satisfying V if one should add/delete at least en 2 edges to turn G into a graph satisfying V. 



Definition 1.1. (Testable property) A graph property V is called testable if there is an algorithm 
Tp, called a tester, that does the following: given e > and a graph G, the tester Tp samples a set 
S of q-p(e) vertices from G, checks for every i,j € S whether (i,j) £ E{G) and then accepts/rejects 
deterministically based on the subgraph of G spanned by S. The success probability ofTv should be 
at least „■ In other words, if the input G satisfies V then Tp accepts it with probability at least |, 
and if G is e-far from satisfying V then Tp rejects G with probability at least |. The function qp(e) 
is called the query complexity ofTp, and does not depend on the size of the input graph. 
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The usual definition of a property V being testable, as introduced in [9], allows for the algorithm 
to be adaptive, but as proved in [TJ [10] one can transform any tester into a tester of the form stated 
in Definition 11.11 with a very minor loss in the query complexity. Therefore we do not lose generality 
by restricting ourselves to Definition 11.11 The important point to observe about Definition 11.11 is 
that the algorithm makes its decision solely on the basis of the random inspections it makes into 
the input graph G. In other words, the decision of the algorithm is uniquely determined by the 
distribution of induced subgraphs of size q-p(e) in the input graph G. 

Starting with |9J, the problem of characterizing the testable graph properties received a lot 
of attention and by now there are several general results of this type, see [2j [6] and the recent 
surveys [H (TTJ for more results and references on graph property testing. A drawback of these 
characterizations is that they are hard to state (and use). An alternative clean characterization 
was recently obtained by Lovasz and Vesztergombi [5] . To state this characterization we need a bit 
of notation. 

A A;-colored graph on n vertices is a coloring of the edges of K n (the complete graph on n 
vertices) using k colors. Thus, a graph can be thought of as a 2-colored graph. A property of 
fc-colored graphs is again just a family of fe-colored graphs closed under isomorphism, and it is 
said to be testablqj if it is testable in the sense of Definition 11.11 A (k, m)-coloring of a graph 
G is a coloring of the edges and non-edges of G with the colors {1, . . . , k} , so that edges are 
colored by {1, ...,m} and non-edges are colored by {m + 1, ..., k}. The following is the notion of 
non-deterministic testing introduced in [5]. 

Definition 1.2. (Non-deterministically testable property) A graph property V is called non- 
deterministically testable if there are integers k, m and a property Q of k-colored graphs so that: 

1. A graph G satisfies V if and only if there is a (k,m) -coloring of G which satisfies Q. 

2. Q is testable. 

We are now ready to state the characterization of the testable graph properties that was obtained 
by Lovasz and Vesztergombi [5]. 

Theorem 1. ([5]) A property V is testable if and only if it is non-deterministically testable. 

Clearly any testable property is also non-deterministically testable, thus the interesting part of 
the above theorem is that given the fact that a property is non-deterministically testable, one can 
construct a standard tester for the property. Quoting [5], "one could say that this theorem shows 
that "P=NP" for property testing in dense graphs". We refer the reader to [3] for several nice 
illustrations showing how to apply Theorem [TJ 



1 We define a fc-colored graph to be e-far from satisfying a property Q of fc-colored graphs if one should modify 
the colors of at least en 2 edges in order to turn G into a fc-colored graph satisfying V. 



The proof of Theorem [T] in [5] used the machinery of graph limits. Hence, the proof was not 
explicit, that is, given the fact that a property V is non-deterministically testable (in the sense 
of Definition II, 2p , it proved the existence of a standard testing algorithm for V (in the sense of 
Definition II. ip but it did not supply any upper bound for the query complexity of the new tester 
(i.e. the function q-p(e) in Definition |1.1[) . Lovasz and Vesztergombi [5 J thus asked if Theorem Q] 
can be proved using explicit arguments that will give an effective bound. Our main result in this 
paper gives a positive answer to their question. 

Our new proof of Theorem [1] uses several tools related to Szemeredi's regularity lemma [12]. In 
Section [2] we give the necessary background for applying this lemma, state some previous results as 
well as some preliminary lemmas that will be used in our new proof of Theorem [TJ As the proofs of 
these technical lemmas are somewhat routine we differ them to Section |U The proof of Theorem Q] 
appears in Section [3] As our proof applies the regularity lemma, although the bounds it supplies 
for q-p{e) are explicit, they are rather weak ones, given by Tower-type function of e. Therefore, we 
will not keep track of the exact dependence of q-p(e) on e. Finally, as we mentioned in the abstract, 
we believe that our proof gives a nice illustration of the fact that the regularity lemma implies that 
all graph can be "approximated" using only a finitely many template graphs. In fact, this intuition 
is the main idea behind the proof. 

2 Tools and Preliminary Lemmas 

As mentioned earlier, our proof of Theorem Q] will apply various tools related to Szemeredi's regu- 
larity lemma [12] . We will start with the basic definitions, then state some previous results that we 
will use (Theorem [2] and Lemmas 12.41 and \2.8\i and then state some technical lemmas that we will 
need for the proof (Lemmas 12.131 12.141 and 12. 16p . The proofs of these technical lemmas appear in 
Section^ Here, and throughout the paper, when we write x = y±z we mean that y — z<x< y + z. 
Given two disjoint vertex sets U, V we use E(U,V) to denote the set of edges connecting U to 
V and set d(U,V) = \E(U,V)\/\U\\V\ to be the density of the bipartite graph between U and V. 
The basic notion of a regular pair is the following. 

Definition 2.1. (Regular pair) Suppose U, V are disjoint vertex sets in a graph and let 7 G (0, 1). 
The pair (U, V) is said to be ^-regular if for every two subsets U' C U , V' C V satisfying \U'\ > ^\U\, 
\V'\ > 7[V| the inequality \d(U,V) - d(U',V')\ < 7 holds. 

A 7-regular pair can/should be thought of as behaving almost like a random bipartite graph 
of the same density. A partition Vi,...,V r of the vertex set of a graph is called an equipartition if 
I \Vi\ — \Vj\\ < 1 for every 1 < i < j < r. The order of a partition V\, ..., V r is the number of parts 
in it (i.e. the integer r). 

Definition 2.2. (Regular equipartition) An equipartition Vi, ..., V r of the vertices of a graph is 
^/-regular if all but at most 7(2) of the pairs (Vi, Vj) are ^-regular. 



We now define a graph property of having a 7-regular equipartition with a predefined set of 
densities. 

Definition 2.3. (Regularity instance) A graph regularity instance R is given by a regularity 
parameter 7, an integer r (the order of R), a set of densities rjij where 1 < i < j < r, and a set of 
non-regular pairs R of size at most 7(2) ■ A graph G is said to satisfy R if G has an equipartition 
V\, ..., V r such that for every (i,j) £ R the pair (Vi, Vj) is ^-regular and satisfies d(Vi, Vj) = rjij. 

A key element in the proof of Theorem Q] is the following result, which follows from the main 
results of [21 [3]. It allows us to test how close a graph is to satisfying a given regularity instance. 

Theorem 2. ([21 [3]) Let R be a graph regularity instance, let £2 > ei > and let p < |. Then 
there is a tester T = 1&[R, £i, £2,p) that distinguishes graphs that are e\-close to satisfying R from 
graphs that are e^-far from satisfying R, with success probability at least 1 — p. 

Furthermore, the query-complexity ofT depends only on R, ei, 62 and p (and not on the input 
graph) and can be expressed as an explicit function of these parameters. 

We note that the arguments used in [21 [3] to prove the above result all relied heavily on the 
regularity lemma. Therefore, the bounds they give have a very poor (yet explicit) Tower-type 
dependence on the input parameters. 

The second result we will need is Corollary 3.8 from [2j. 

Lemma 2.4. ([2j) Let R be a graph regularity instance of order r, regularity parameter 7, densities 
rjij and a set of non-regular pairs R . Suppose that a graph G has an equipartition V\, ..., V r such 
that for every (i,j) £ R the pair (Vi, Vj) is -y-regularQ and satisfies d(Vi,Vj) = rj^j ± %£. Then G 
is e-close to satisfying R. 

We now turn to consider /c-colored graphs. We first generalize the definitions of a regular 
pair, regular equipartition and regularity instance, to the more general setting of /c-colored graphs. 
We start with the following notation: Suppose U, V are two disjoint vertex sets in a /c-colored 
graph. We uscl d (U,V) to denote the density of edges of color £ between U and V, that is 
d e (U, V)\ = E £ (U, V)\/\A\\B\, where E £ (U, V) is the set of edges with color I that connect U to V. 
In case there is more than one graph, we use d^U, V) to denote the density of edges colored by I 
between U, V in the /c-colored graph G. 

Definition 2.5. (Regular pair in a /c-colored graph) Suppose U, V are disjoint vertex sets in 
a k-colored graph. The pair (U,V) is ^-regular if for every two subsets U' C U, V' C V satisfying 
\U'\ > j\U\, \V'\ > y\V\, and for every !<£<k, the inequality \d\U, V) - d e (U', V')\ < 7 holds. 



2 Actually, Corollary 3.8 in [2] only needs to assume that (Vi, Vj) is (7 + : ^ ) -regular 
s Here, and throughout the paper, wi 
"x superscript £" not "a; to the power £ 



3 Here, and throughout the paper, we always use i as a superscript and never as an exponent. So x"~ should read 



Definition 2.6. (Regular equipartition in a fc-colored graph) An equipartition Vi,...,V r of 

the vertices of a k-colored graph is ^-regular if all but at most 7(2) of the pairs (Vi, Vj) are 7 -regular. 

Definition 2.7. (A;-colored regularity instance) A k-colored regularity instance R is given by 
a regularity parameter 7, an integer r (the order of R), a set of densities rjf • where 1 < % < j < r 
and 1 < £ < k, and a set of non-regular pairs R of size at most 7(2) • A k-colored graph G is said 
to satisfy R if G has an equipartition V\,...,V r such that for every (i,j) ^ R the pair (Vi, Vj) is 
■^-regular and satisfies d l (Vi, Vj) = r]f ■ for every 1 < £ < k. 

The Regularity Lemma for A;-colored graphs states that every A:-colored graph has a 7-regular 
equipartition whose order can be bounded by a function of 7 and k. It can be formulated in terms 
of regularity instances in the following way. 

Lemma 2.8. (Regularity lemma for fc-colored graphs) For every 7 > and integers t and 
k, there exists T = I J2^g| (7, t, k) so that every k-colored graph satisfies some k-colored regularity 
instance of order at least t and at most T, and regularity parameter 7. 

Note that the usual regularity lemma is the special case of the A:-colored regularity lemma with 
k = 2. The proof of the A;-colored version requires a minor adaptation of the proof of the standard 
regularity lemma. See [3] for the details. 

Having described the known results that will be used in the proof of Theorem [H we now turn 
to state the additional technical lemmas we shall rely on. We start with a lemma that allows one 
to approximate the number of copies of small A:-colored graphs in a /c-colored graph which satisfies 
a given regularity instance. 

Definition 2.9. (IC(B, W, a)) Suppose B is a k-colored graph on vertices [q] in which (i,j) is 
colored by c(i,j). Suppose W = {rjfj : 1 < i < j < q, 1 < £ < k} are densities and a : [q] — > [q] is 
a permutation. Define: 

IC(B,W,a) = l[ri^ ) ^ )) 

i<j 

Definition 2.10. (IC(B,W)) Suppose B and W are as in the previous definition. Define: 

where Aut(B) is the number of automorphisms of B, that is, the number of injections <j) : V(B) i-> 
V(B) that preserve the color of the edges. 

Definition 2.11. (IC(B,R)) Let R be a k-colored regularity instance of order r and densities 
{Vi j :i ^^<i^ r ) 1<^< k}. Let B be a k-colored graph on the vertex set [q]. For every A C [r] 
of size q put W(A) = {rjj ■ : i,j G A, 1 < £ < k}. Define: 



IC{B,R) = ±- Y. IC(B,W(A)) 



W/ AC[r],\A\=q 



Remark 2.12. It is easy to see that IC(B,W,a), IC(B,W) and IC(B,R) are quantities in [0, 1]. 

To understand Definition 12.91 consider a random fc-colored graph whose vertices are V\ U ... U V q . 
Suppose that the probability that the color of (vi,Vj) is £ is r]f ,■ (u, G Vi,Uj G Vj). Suppose 
also that |Vi| = ■■■\V q \ = n. Let B be a fixed /c-colored graph on the vertices [q] and let <r be a 
permutation of [q]. What is the expected number of g-tuples v\ € Vi,...,v q G V q which span a 
copy of B such that x>i plays the role of <r(i)? It is easy to see that this number is IC(B, W, a)n q 
where we set W = {rjf • '■ 1 < i < j < q, 1 < ^ < A;}. We show (in Lemma l4.2p that for every 5, if 
all pairs (Vi, Vj) are 7-regular for some small enough 7, then the number of such g-tuples v\, ...,Vq 
is (IC(B, W,o~) ± 5)n q . This fact demonstrates the almost random behavior of regular partitions. 
The expression IC(B, W) (in Definition 12. 10|) is used to approximate the total number of g-tuples 
Vi G V\, ...,v q G V q which span a copy of B. The expression IC(B,R) (in Definition 12. lip is used 
to approximate the number of copies of B in a graph that satisfies the regularity instance R. The 
most general result of this sort is the following lemma. 

Lemma 2.13. For any 5 > and integers k and q there are 7 = l2.13|(^> 9> ^0 an< ^ ^ = 42.13|(^> 9' ^) 
lozi/i i^e following property: For any k- colored regularity instance R of order at least t and regularity 
parameter at most 7, and for any family B of k- colored graphs on q vertices, the number of copies 
of k-colored graphs B G B in any k-colored graph on n vertices satisfying R is 



i?/ c(B 'H(«) 



The proof of Lemma 12.131 appears in Subsection 14.11 The second lemma we will need is the 
following. 

Lemma 2.14. For every 5 and integers q and k there is A = A|2.14| (^ q, k) such that the following 
holds: Let R and M be k-colored regularity instances of order r, and densities 

{Vi,j : 1 < » < i < r, 1 < £ < k} 

and 

{/4 : 1 < i < j < r, !<£<k} 



respectively. Let B be a family of k-colored graphs of order q. Suppose that [i\ ■ = tj\ ■ ± A. Th 



en 



Y,IC{B,R)-Y,IC{B,M) 

BeB BeB 



<S. (1) 



The proof of Lemma 12.141 appears in Subsection 14.21 The last ingredient we will need is the 
following lemma whose proof appears in Subsection 14.31 
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Definition 2.15. (Chopping) Let R be a graph regularity instance of order r, regularity parameter 
7, densities \iij and a set R of non-regular pairs. A (k,m)- chopping of R is any k-colored 
regularity instance R' of order r, regularity parameter 27 , non-regular set R' = R and densities 
rjfj that satisfy 

m k 

Y ^tj = I** and Y vij = 1 - in* 

1=1 £=m+l 

Lemma 2.16. For every 7 > and integers t and k, there is r \2.1Qilit, k) such that the following 
holds: Suppose R is a graph regularity instance of order at most t and regularity parameter 7, that 
R' is a (k,m)- chopping of R and that G is a graph satisfying R with at least r m jfiO y, t, k) vertices. 
Then G has a (k,m)- coloring that satisfies R' . 

3 The New Proof of Theorem [J 

Consider any e > 0. Let property Q and integers k and m be those from Definition If .2\ that is, 
so that Q is a property of /c-colored graphs and so that a graph satisfies V if and only if it has 
a (k, m)-coloring satisfying Q. Suppose Q can be tested by a tester Tq as in Definition lf.fi Let 
q = gg(|) be the query-complexity of Tq, i.e. the number of vertices that Tq samples when testing 
if a /c-colored graph satisfies Q or is |-far from satisfying it. Let B be the set of all fc-colored graphs 
B on q vertices, such that when Tq samples a fc-colored graph isomorphic to B, it accepts the 
input. Put 

t = 1%Tgl/ 12, q,k), 7 = T233l( 1 A 2 > <?,£), T = T^ 1 /2k,t,k) 

and 



ri = mml )^jgl/l2, q,k) 



ill)! 

200m 



Let I be the set of all fe-colored regularity instances of order at least t and at most T, regularity 
parameter 7 and densities from the set {0, n, 2r/, 3n, ..., 1}. Observe that all the above constants, as 
well as |/|, depend only on e, k and the properties V and Q. We now arrive at the critical definition 
of the proof: 

Definition 3.1. (Good regularity instance) A graph regularity instance R with regularity mea- 
sure 7/2 is considered good if it has a (k,m)- chopping R' that satisfies: 

1. Rl G /. 

«■ J2 B eBlC(B,R')>l 

We say that R' is a witness to the fact that R is good. We set GOOD to be the family of good 
regularity instances. 



Suppose first that the input graph has less than r ^2.16| (i'^' ^) ver tices. In this case the algorithm 
can just ask about all edges of G and then check if G satisfies property V. Since 7, T and k are all 
functions of e, V and Q, we get that so is the query complexity in this case. Hence from this point 
on we will assume that n > i WJjrf -%, T, k). 

The following are the key observations we will need for the proof. 

Claim 3.2. If G satisfies V, then G is ^-close to satisfying some R E GOOD. 

Claim 3.3. If G is e- far from satisfying V, then G is ^ -far from satisfying any R £ GOOD. 

Let us first complete the proof based on these claims. We describe a randomized algorithm that 
distinguishes between graphs satisfying V and graphs that are e-far from satisfying V, with success 
probability at least 3, and by making a number of queries that can be bounded by a function of 
e. Put p = 3 i G qq D i ■ Let G be a graph on at least r ^ig^ ,? 1 , k) vertices. In order to test G for 
property V we do the following: For every R € GOOD use 7i3(-R, f , |,p) (recall Theorem [2|) to 
test whether G is |-close to satisfying R or |-far from satisfying it. If one of these tests accepts, 
then accept the input G, otherwise reject it. If G satisfies V then by Claim [3~2l it is |-close to 
some R € GOOD, and our tester accepts it with probability at least 1 — p > 3. If G is e-far from 
satisfying V then by Claim [3731 it is |-far from satisfying any R € GOOD. Our tester accepts G 
with probability at most \GOOD\ ■ p = | and so it rejects with probability at least 3, as required. 
Finally, since all the parameters involved are given by explicit functions of e and the properties V 
and Q, we get via Theorem [2] that the number of queries made by the tester can be bounded by 
an explicit function of e. 

We now complete the proof of Theorem Q] by proving Claims 13.21 and 13.31 

Proof (of Claim [3.2ft : Suppose G satisfies V. Then there exists some (k, m)-coloring of G that 
satisfies Q. Denote this fc-colored graph by H . By Lemma [2. 81 H satisfies some fc-colored regularity 
instance R[ of order t < r <T, regularity parameter ^ and densities 

{vf,j : 1 < * < i <r, l<£<k}. 

Since H satisfies Q, we infer that Tq must accept H with probability at least 3. This means that 
when sampling q vertices from H, the probability to get a /c-colored graph isomorphic to one of the 
elements of B is at least 3 . By the choice of 7 and t via Lemma 12.131 we get that this probability 
is J2 B eB IC ( B > R i) ± 13 • Therefore 

J2 IC(B, R'i)>^- (2) 

Let Vx,...,V r be an equipartition of H which corresponds to R[. We claim that Vi,...,V r is 
also a ^-regular equipartition of G. To see this let (i,j) £ R{. For every x G Vi, y G Vj, the 
edge (x,y) is in G if and only if (x,y) is colored in H by a color I £ {!,..., m}. Therefore 



d G (Vi,Vj) = Y.T=i d H( v ^ v i)- Let w £ v i> v> £ v i such that l^'l ^ il^U^I > i\Vj\- As we 

assume that (Vi,Vj) is a ^-regular pair in H, we have \d H (U',V') — dr H (Vi,Vj)\ < jn for every 
1 < £ < k. By the triangle inequality, we have 

m 

\dG(U , ,V , )-d G (y i ,V j )\ < Y,\d e H (U',V) -c&(^i)l <^<\- 

e=i 

We thus infer that G satisfies a regularity instance R\ with order r, regularity parameter ^, a set 
of irregular pairs R[ and densities {7/y : 1 < i < j < r} where rj^j = J2b=i vfj- 

Let R 2 be the fc-colored regularity instance that is obtained from R[ by replacing each of the 
densities rjf • with the closest integer multiples of r\. Observe that we thus change each density by 
at most rj. As we chose rj < ^2.l4|(l2'(/> k), we get from Lemma 12.141 and <^j that 

Y, IC(B, R> 2 ) > £ IC(B, R[) - 1 > 1 . (3) 

Denote the densities of R' 2 by /u| •. Let R2 be the graph regularity instance of order r, regularity 
parameter ^, densities {/ijj : 1 < i < j < r}, where fiij = YlT=i lAji an d a se t of irregular pairs 
R' 2 - By Definition 12.151 R 2 is a (k, ?n)-chopping of i?2- Furthermore R' 2 G / and we get from ([3]) 
that ^sgg-TC^-B, -^2) — 2' ^y Definition 13.11 i?o is a witness to the fact that R2 is good. Finally, 
recalling that r\ < 2Q 2 Qm , we get that for every i < j we have 



\Vi,j ^i,j\ 



X>^-X>i 



t 

1=1 1=1 



^Eki-/4; 



1=1 



e (i) 2 

<mri < y2J 
~ ' ~ 200 



In other words, the densities of i?i and i?2 differ by at most 2 ^i . We now get via Lemma 127 
that G is |-close to satisfying R2. ■ 

Proof (of Claim 13.31) : We will prove that if G is l-close to satisfying some R G GOOD 
then G is e-close to satisfying "P. Suppose that G is |-close to a graph G' that satisfies some 
R G GOOD. By Definition O i? has a (Jfe, m)-chopping # such that 52 BeB IC(B,It) > \. By 
Lemma |2. 161 G' has a (/c, m)-coloring satisfying R' . Call this fc-colored graph H'. By Lemma 12.131 
the probability to get a /c-colored graph isomorphic to an element of B when sampling q vertices 
from H' is ^.Bes IC(B, R') ± -tj. Therefore this probability is at least yL If H' was §-far from 
satisfying Q this probability would have to be at most „-, because Tq would have to reject H' with 
probability at least g. So we infer that H' is |-close to satisfying Q. This means that H' can be 
turned into a A:-colored graph H" that satisfies Q by changing the colors of at most |ra 2 edges. 

Construct a graph G" by doing the following: For every x,y G V(H"), put an edge between 
x and y if (x,y) is colored by a color ^ G {1, ...,m} in i7". First, G" satisfies V because H" is a 
(fc, m)-coloring of G" which satisfies Q. Furthermore, we claim that G" is |-close to G' . Indeed, 
observe that the number of edge modifications we performed is exactly the number of pairs (x, y) 

9 



so that in one of the graphs H' , H" the color of (x, y) belonged to the set {1, . . . , to} while in the 
other it belonged to {to + 1, . . . , k}. This number is clearly bounded from above by the number of 
modifications made when turning H' to H" . Since H" and H' differ in at most |n 2 edges the same 
thus holds for G" and G', implying that G' is §-close to G" . Since G is assumed to be |-close to 
G' , we infer that G is e-close to G" . Since G" satisfies V the proof is complete. ■ 



4 Proofs of Auxiliary Lemmas 

4.1 Proof of Lemma 12.131 

We will need the following folklore result stating the a g-tuple of vertex sets that are pairwise 
regular have the correct number of copies of K q (the complete graph on q vertices). A detailed 
proof can be found in |7j. 

Lemma 4.1. For every 5 > and q there exists 7' = TTr{5,q) such that the following holds: 
Suppose V\, ..., V q are disjoint vertex sets in a graph, \Vi\ = ... = \V q \ = n, and all pairs (Vi,Vj) are 
^'-regular. Put IC(K q ; V±, ..., V q ) = \\ d(Vi,Vj). Then the number of q -tuples v% G Vx,...,v q G V q 

i<j 

that span a copy of K q is 

(IC(K q ;V 1 ,...,V q )±S)n q 

As a first step towards proving Lemma 12.131 we prove a variant of Lemma 14.11 for /c-colored 
graphs with respect to IC(B, W, a). We will then obtain similar lemmas with respect to IC(B, W) 
and IC(B, R) (recall Definitions 12.91 [2.10l and l2.1ip and then derive from them the proof of Lemma 

Lemma 4.2. For every 5 > and q there exists 7 = T3~2l(<5, q) such that the following holds: 
Suppose Vi,...,Vq are disjoint vertex sets in a k-colored graph, \V\\ = ... = \V q \ = n, and all pairs 
(Vi,Vj) are ^-regular. Put W = {d (V{,Vj) : 1 < i < j < q, 1 < £ < k}. Then for every k- 
colored graph B on the vertices [q], and for any permutation a : [q] — > [q], the number of q-tuples 
vi G Vi, ...,v q G V q which span copy a of B with Vi playing the role of o~{i) is 

(IC(B,W,a)±5)n q 

Proof: While the proof of Lemma 14.11 can be adapted to the more general setting of Lemma 14.21 
it will be easier to reduce Lemma 14.21 to Lemma 14.11 Set 7 = T|~2](<5, q) = lTT|(5, q) and suppose 
(Vi,Vj) is 7-regular for every 1 < i < j < q. We call a g-tuple V\ G Vi,...,v q G V q proper, if 
vi, ..., v q span a copy of B with vi playing the role of a(i). 

We denote by c(i,j) the color of the edge (i,j) in B. Let Eij be the set of edges connecting a 
vertex in Vi to a vertex in Vj whose color is c(a(i),a(j)). If v\ G V±, ...,v q G V q is proper, then the 
color of (vi,Vj) is c(a(i),a(j)). We see that the edges in Eij are the only edges between Vi and Vj 
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that can "participate" in a proper g-tuple. Define a g-partite graph S with vertex sets Vi, ..., V q , in 
which the edges between V, and Vj are Eij. A g-tuple v\ € Vi,...,v q E V q is proper if and only if 
it spans a copy of K q in S. So in order to prove Lemma 14.21 it is enough to show that the number 
of copies of K q in S is (IC(B,W,a) ± 5)n q . By Lemma l4~T| the number of copies of K q in S is 
{IC{K q ;V\,...,V q ) ± 5)n q where IC(K q ;Vi,...,V q ) = U d s {Vi,Vj). So to complete the proof it is 

i<j 

enough for us to show that IC(B, W, a) = IC(K q ; V±, ..., V q ). Indeed, we have 

IC(B,W,a) = Y[d c ^^(V i ,V J ) = YlwMri ■ = *[[d s (V i ,V j ) = IC(K q ;V 1 ,...,V q ) 



Lemma 4.3. For every 5 > and every q there exists 7 = T3~3](<5, q) such that the following holds: 
Suppose that Vi,...,V q are disjoint vertex sets of size n each, and all pairs (Vi,Vj) are ^-regular. 
Put W = {d e (Vi, Vj) : 1 < i < j < q, 1 < £ < k}. Then for every k-colored graph B on the vertices 
[q] , the number of copies of B which have precisely one vertex in each of the sets V\ , . . . , V q is 



(IC(B,W)±5)n 



<-! 



Proof: Set r W^J ^,q) = TO](^t> (/)• Let V\,...,V q be as in the statement, and let B be any k- 
colored graph. By Claim B~2l for any permutation a : [q] — > [q], the number of copies of B spanned 
by v\ € Vi, ..., v q € V q such that Vi plays the role of a(i) is ( IC(B, W, a) ± -, j n q . If we sum over 



all permutations a : [q] — > [q], we get Yl \IC(B,W,a) ± -^\)n q . In this summation, we count every 

copy of B exactly Aut(B) times. Thus, by dividing by Aut(B), we get that the number of copies 
of B is 

b)(?( /cw "' ff)± ^"') - {-A^mp CiB - w ' a)±s )"' 



Aut(B) 



(IC(B, W) ± 5)n q 



Lemma 4.4. For every 5 > and q there are 7 = IT^^S, q) and t = 1tt~${o~, q) such that the 
following holds: Suppose that R is a k-colored regularity instance of order at least t and regularity 
parameter at most 7. Then for every k-colored graph B on q vertices, the number of copies of B 
in any n-vertex k-colored graph satisfying R is 



77 

(IC(B,R)±5)[ 



11 



Proof: Put 

and 



r 4g 2n 



T = 1Ql(^g)=min|^,70lf^,gj| 



Let i? be a fc-colored regularity instance as in the statement, let G be an n- vertex /c-colored 
graph satisfying R and let B be any fc-colored graph on q vertices. Let Vi, ..., V r be an equipartition 
of T^(G) satisfying i?. Let C be the collection of all ^-tuples that have at most one vertex in each 
of the sets VJ. By a union bound, the number of g-tuples that have more than one vertex in one of 
the sets V, is at most 

^r(:: 2 2 M(;M(;H<" 

So \C\ > (l — 4) (")• Therefore the lemma will follow from showing that the number of g-tuples 
belonging to C which span a copy of B is (lC(B,R) ± |<5) \C\. 

Given A = {xi, ...,x ? } C {1, ...,r} let JV(>1) denote the number of g-tuples v\ £ V Xl , ■■■■,v q G V^ 
which span a copy of -B. We say that A is good if all the pairs (V Xi ,V Xj ) (1 < i < j < q) are 
7-regular. Otherwise A is called bad. If A is good we get from our choice of 7 via Lemma [4.31 that 

N{A)=(^IC(B,W(A))±\^(^) q 

where we set 

W(A) = {d e (Vi, Vj) :i,j e A, \<t<k}. 

We can thus estimate the number of g-tuples belonging to C which span a copy of B by 
£ N ^ = £ ((^W(A))± 1 -8)^y) + £ iV04) = 

4C[r], |A|=g A is good ^ ^ ' ' A is bad 



•r_i 1 <n_ g V V / / ^ ifl bad 

(lC(B,R)± 1 l s)( r )^) q + £ (N(A)-IC(B,W(A))^) 



71\9 

r 



AC[r],|A|=g A is bad 

A is bad 

Since (Vj, Vj) is 7-regular for every (i,j) $. R there are at most 7(2) pairs (Vi, Vj) which are not 



(4) 



7-regular. Therefore the number of bad sets A C {1, ...,r} is at most 
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Using the facts that < IC(B, W{A)) < 1 and < N(A) < (f ) q for every K[r], and the bound 
on the number of bad sets, we get that 



£ (n(A)-IC(B,W(A)) 



A is bad 



r 



<-¥OC 



By plugging the above inequality in (4) we get that the number of (/-tuples belonging to C which 
span a copy of B is (IC(B,R) ± |<5) (J) (f) 9 . Observe that \C\ = Q (f) 9 . Therefore the number 
of those g-tuples is i y IC{B,R) ± |5) \C\, as required. ■ 

Proof (of Lemma [233]): Put t = tgj^d, q, k) = tQ|(fc~0») 6, q), 7 = TZJ5J 6, q, k) =-^g.k~^6, 
Let R be a regularity instance of order at least t and regularity parameter at most 7 and let G be 
a ^-colored graph satisfying R. Let B £ B. By our choice of 7 and t via Lemma I4.4I the number of 
copies of B in G is (IC(B,R) ± A; ^ a '<J)("). Clearly |i3| < M 2 -', so the number of copies of graphs 
B € B in G is 



£ ((lC(B,.R)±*-(8)*) ( U )) = (Y J IC(B,R)±5 



n 
lU{B,R)±d I I 

BeB v ' y,i/ 7 \_BeB 



4.2 Proof of Lemma I2.14I 

We will derive Lemma I2.14I from the following lemma. 

Lemma 4.5. For every 5 and q there is X = fa~%(S, q) such that the following holds: Let R and 
M be k-colored regularity instances of order r, and densities {rjf „■ : 1 < i < j < r, 1 < £ < k} 
and {[if • : 1 < i < j < r, 1 < £ < k} respectively. Let B be a k-colored graph on the vertices [q\. 
Suppose that [if ■ = n\ ■ ± A. Then 

\IC{B,R)-IC(B,M)\ <5 
Proof: Put 

r 

2^>q\ 
Let R,M be /c-colored regularity instances as in the statement. Let A = {27, ...,x„} C {l,...,r}, 
and put 

Wfl(A) = fo^. : l<i<j<q,l<£<k} 

and 

WW(A) = {4 : l<i<j<g,l<^<A:}. 
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Denote the color of (i,j) in B by c(i,j). Let a : 
have 

\IC(B,W R (A),a) - IC(B,W M (A),a)\ = 



[q] be a permutation. By Definition 12.91 we 



TT v c M l)Mm 

l<i<j<g 



n ^s Mj)) 



l<i<j<q 



TT ^J?^ 



i<*<i<9 



i<j<j< g 



Opening the parentheses in the above product gives 2^' — 1 summands, all of which are multiples 
of ±A. Therefore \IC(B,W R (A),a) - IC(B,W M {A),a)\ < A2®. By Definition EH the triangle 
inequality and our choice of 7 we have 



\IC(B,W R (A)) - IC(B,W M (A))\ 



< 



-j^-^ J2 (IC(B, W R (A),a) - IC(B, W M (A),a)) 

-11x2(2) = S . 



By Definition 12.111 and the triangle inequality we have 



\IC(B,R)-IC(B,M)\ 



i- Yl ( IC ( B > W ^ A )) ~ IC ( B > W m(A))) 

V AC{1 r}, \A\=q 



<s 



as needed. 



Proof (of Lemma 12.141) : Put A = A)2,i4| (<5, Q, k) = Ax^fc ™8, q)- By the choice of A via Lemma 
131 we get that \IC(B,R) - IC(B,M)\ < k~^d for every B £ B. Since \B\ < fc© the triangle 
inequality thus gives ([JJ. ■ 



4.3 Proof of Lemma 127161 

We will derive Lemma 12.161 from the following lemma. 

Lemma 4.6. Let U,V be disjoint vertex sets in a graph satisfying \U\ = \V\ = n > n r|~g| (7, k). 
Suppose the bipartite graph (U, V) is ^-regular with d(U, V) = [i. Let {77 : 1 < £ < k} be nonnegative 
reals satisfying YlT=i w = A* an ^ J2e=m+i V ■*• — M- Then there is a (k, m)-coloring of (U, V) such 
that the resulting k-colored graph is 2^-regular and satisfies d (U, V) = rf for every 1 < I < k. 

For the proof of Lemma 14.61 we need the following standard Chernoff-type inequality: 

Lemma 4.7. Suppose Xi, ...,X m are independent Boolean random variables and P(AQ = 1) = pi. 
Let E = YT=iPi- The.nW{\Y™ =1 Xi-E\ > 5m) <2e~ 2s2m . 
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Proof (of Lemma 14. 6j ): We will show that the edges between U and V can be colored with colors 
1, ..., m in a way that satisfies the requirements. The same argument can be used to color the non- 
edges with colors m + 1, ..., k. First assume that fj, < 7. If this is the case, just color any rfn 2 of the 
edges between U and V with color £, for every 1 < £ < m. This way we made sure that d (U, V) = rf. 
Let U' C U, V C V with |£/'|, | V'| > 27n. Before the coloring we had d(C/', V') = fi± 7. Therefore 
after the coloring we have < d e (U', V') < \i + 7, so \d e (U', V) - rf \ < p, + 7 < 27. 

Assume from this point on that fj, > 7. For every edge e S £"(£/, V) roll a die with sides 1, ..., m 
so that probability of side £ is 2_. If the die falls on side £ then color e with color £. Then the 
expected number of edges of color £ is 77 n 2 . By Lemma 14.71 the probability that the number of 
edges colored by £ deviates from its expectation by more than 712 is at most 2e~ 2n '^ < 2e _2ra . If 
n > iW^jjk) then this probability is less than j^. This means, by a union bound, that with 
probability at least | the number of edges colored by £ is 77 n 2 ± n? for every 1 < £ < m. 

Claim 4.8. With probability at least | aW sets U'C(J,F'CF suc/i t/iat \U'\, \V'\ > 2jn satisfy 
d e (U', V')=r 1 e ±% 

Proof: Let U' C U,V C V such that |C7'|, |V'| > 2'jn. The density of edges between U' and V 
before the coloring is ^±7. Therefore the expected density of edges with color £ is 2- (/a ±7) = 77 ±7. 
So it is enough to show that with probability at least |, there are no sets U', V and color 1 < £ < m 
such that the density of edges of color £ between U' and V' deviates from its expectation by more 
than 2. 

By Lemma 14.71 the probability that the density of edges of color £ between U' and V' deviates 
from its expected value by more than \ is at most 2e~ 2( i )2|C7 ' l|v " l/d(c/ '' y,) < 2e-^\ v 'W Y '\l 2 . We 
assumed that \U'\, \V'\ > 2771, so this probability is at most 2e~ 7 n ' 2 . The number of choices of 
sets U', V as above is at most 2 2n , and the number of colors is at most k, so by a union bound 
we get: The probability that there are sets U', V and a color £ , such that the density of edges 
between U', V with color £ deviates from its expectation by more than ^ is at most k2 2n 2e~ 1 n ' 2 . 
This expression is less than \ if n is large enough, namely n > 7*4^(7, k). M 

Getting back to the proof of Lemma 14.61 we see that so far we proved that with probability at 
least 5 the following two conditions hold: 

1. d^(U, V) = rf ± n _ 2 for every 1 < £ < m. 

2. d e (U', V) = rf ± -^- for every 1 < £ < m and every two sets U' C U, V' C V of size at least 
2771. 

Now take a coloring that satisfies conditions 1 and 2. Let us write d^(U, V) = rf + e^ where 
e < n~2. Observe that 

mm m 

i=\ t=i e=i 
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Therefore ^ e^ = 0. We can change the colors of at most ran? < kn? edges to make sure that 

d (U,V) is exactly rj . For every U' C U, V' C V of size at least 2jn this final change changes 
d e (U',V) by at most r which is less than ^ if n > Hgjrf'y, k). So in the end we have 

d e (U' , V) = rf ± 27 as required. ■ 

Remark 4.9. In fact, we could have proved the following stronger lemma: Let U, V be disjoint 
sets in a graph, \U\ = \V\ = n > T Tgjrf 'y, e, k). Suppose U,V is ^-regular and d(A,B) = fi, and let 
{n : 1 < t < k} be nonnegative numbers satisfying Ylh=i V M an d Yli= m +i V ^ ~ A 4 ■ Then 
there is a (k,m) -coloring of U,V such that the resulting k-colored graph is 7(1 + e)-regular and 
satisfies d (U, V) = r\ . The choice of '27 in Lemma \4-6\ and in Definition \2.15\ is for convenience. 

Proof (of Lemma 12. 16|) : Put i W^f y,t, k) = t ■ r ^^ y, k). Let R be a graph regularity 
instance of order r < t and regularity parameter 7. Let R' be a (k, m)-chopping of R' and let 
{vfj '• 1 ^ i < 3 ^ r ) 1 < ^ < A;} be the densities of R' . Let G be a graph with at least ^2.16| (^' *' ^) 
vertices that satisfies R. Let V%, ..., V r be a 7-regular equipartition of V(G) that corresponds to R. 
For every 1 < i < r we have |V£| > *^--*-^T — > n$~fi( 'y, k). If (i,j) $. R apply Lemma EL"6l for Vi, Vj 
and {77* •, ...,77^ •}. Color the rest of the edges and non-edges arbitrarily. The resulting /c-colored 
graph satisfies R'. ■ 
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